Introducing Crossmodal Biometrics: Person Identification from Distinct Audio & Visual Streams

نویسندگان

  • Anindya Roy
  • Sébastien Marcel
چکیده

Person identification using audio or visual biometrics is a well-studied problem in pattern recognition. In this scenario, both training and testing are done on the same modalities. However, there can be situations where this condition is not valid, i.e. training and testing has to be done on different modalities. This could arise, for example, in covert surveillance. Is there any person specific information common to both the audio and visual (video-only) modalities which could be exploited to identify a person in such a constrained situation? In this work, we investigate this question in a principled way and propose a framework which can perform this task consistently better than chance, suggesting that such crossmodal biometric information exists.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The TUM Gait from Audio, Image and Depth (GAID) database: Multimodal recognition of subjects and traits

Recognizing people by the way they walk – also known as gait recognition – has been studied extensively in the recent past. Recent gait recognition methods solely focus on data extracted from an RGB video stream. With this work, we provide a means for multimodal gait recognition, by introducing the freely available TUM Gait from Audio, Image and Depth (GAID) database. This database simultaneous...

متن کامل

Identity Authentication based on Audio Visual Biometrics: A Survey

Biometric authentication is an emerging technology that utilize biometric data for the purpose of person identification or recognition in security applications. A number of biometrics can be used in a person authentication system. Among the widely used biometrics, voice and face traits are most promising for pervasive application in every life, because they can be easily obtained using unobtrus...

متن کامل

Temporal Structure and Complexity Affect Audio-Visual Correspondence Detection

Synchrony between events in different senses has long been considered the critical temporal cue for multisensory integration. Here, using rapid streams of auditory and visual events, we demonstrate how humans can use temporal structure (rather than mere temporal coincidence) to detect multisensory relatedness. We find psychophysically that participants can detect matching auditory and visual st...

متن کامل

The effect of attention on the illusory capture of motion in bimodal stimuli.

A large body of work now exists that demonstrates the interaction between different sensory modalities when they are integrated into a single coherent percept. Yet, it is not yet clear whether attention plays a critical role in such crossmodal interactions. We investigated the effect of attention on the crossmodal integration of apparent motion signals using the crossmodal dynamic capture parad...

متن کامل

Audiovisual Information Fusion in Human-Computer Interfaces and Intelligent Environments: A Survey

Microphones and cameras have been extensively used to observe and detect human activity and to facilitate natural modes of interaction between humans and intelligent systems. Human brain processes the audio and video modalities extracting complementary and robust information from them. Intelligent systems with audio-visual sensors should be capable of achieving similar goals. The audio-visual i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010